An extensible automated protein annotation tool: standardizing input and output using validated XML
نویسندگان
چکیده
MOTIVATION There is a frequent need to apply a large range of local or remote prediction and annotation tools to one or more sequences. We have created a tool able to dispatch one or more sequences to assorted services by defining a consistent XML format for data and annotations. RESULTS By analyzing annotation tools, we have determined that annotations can be described using one or more of the six forms of data: numeric or textual annotation of residues, domains (residue ranges) or whole sequences. With this in mind, XML DTDs have been designed to store the input and output of any server. Plug-in wrappers to a number of services have been written which are called from a master script. The resulting APATML is then formatted for display in HTML. Alternatively further tools may be written to perform post-analysis.
منابع مشابه
Using DiAML and ANVIL for multimodal dialogue annotation
This paper shows how interoperable dialogue act annotations, using the multidimensional annotation scheme and the markup language DiAML of ISO standard 24617-2, can conveniently be obtained using the newly implemented facility in the ANVIL annotation tool to produce XML-based output directly in the DiAML format.
متن کاملResearch Paper: Representing Information in Patient Reports Using Natural Language Processing and the Extensible Markup Language
OBJECTIVE To design a document model that provides reliable and efficient access to clinical information in patient reports for a broad range of clinical applications, and to implement an automated method using natural language processing that maps textual reports to a form consistent with the model. METHODS A document model that encodes structured clinical information in patient reports whil...
متن کاملThe SALSA Annotation Tool
The SALSA annotation tool supports the graphical annotation of a treebank with semantic roles in the frame semantics paradigm. The tool, which takes corpora in the TIGER XML format as input, supports the whole annotation process from subcorpus extraction to merging individual annotations, and allows for underspecified tags as well as tags beyond the sentence boundary and below the word boundary.
متن کاملProsodically Enriched Text Annotation for High Quality Speech Synthesis
Linguistically enriched text generated from natural language modules contributes significantly on the quality of speech synthesis. For all cases where such modules are not available, such enriched input needs to be produced from plain text in order to maintain quality. This work reports on a framework of several combined language resources and procedures (word/sentence identification, syntactic...
متن کاملGuide to Annotation
A review of multimedia annotation techniques, in particular image annotation, is presented. The annotation requirements for the Benchmarking workpackage of the MUSCLE EU Network of Excellence are also presented and discussed. A significant contribution is the creation of a keyword vocabulary based on an analysis of keywords used in experiments for testing automated image annotation algorithms a...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Bioinformatics
دوره 22 3 شماره
صفحات -
تاریخ انتشار 2006